How to Store and Query XML Data

نویسنده

  • Silvia Stefanova
چکیده

Due to the fact XML is a universal data-exchange format, all the questions concerning storage and querying XML documents have been lately of great popularity. Query languages, called XPath and XQuery have been developed and standardized in order to query XML data not only in XML files, but anything existing as XML, including databases. XML databases running XPath and XQuery are generally classified as native and relational. The native ones are specialized engines evaluating queries on XML documents, while the relational are built on top of existing database engines. The intention of this paper is to discuss some of the main problems and research trends from the bread research field devoted to XML storage and querying. The exposition will start with a description of XML Data Management Systems, go through the query languages XPath and XQuery, examine in details the aspects in the development of XML query processing and end up with the latest proposed algorithms for efficient query processing. XML Data Management Systems Existing architectures of XML database management systems will be presented and commented with respect to storing and querying opportunities of XML data. 1) Storage of XML documents in relational systems To store XML data in relational systems has been popular for the last several years and as well implemented by few commercial DBMS. The reason is that it is based on adaptation and reuse of relational technology. However, there are significant differences among the existing methods on how to save and hence query XML data in RDBMS. Shredding XML documents into relations XML documents are transformed into atomic values and after that stored as relations in relational tables. XQueries are translated to SQL queries to be evaluated by the RDBMS query processor. Several different shredding techniques and query capabilities has been proposed [14, 15]. The advantage of this method is that it does not require big modifications of the existing database engine. XML, stored as unparsed text XML data is stored in VARCHAR or LOB (large object) columns of relational tables. XML data is queried by a XQuery processor external to the database and invoked as a user-defined function. This approach is used by commercial systems offering XML support (MD2, MS SQL). This solution is relatively simple but the entire XML document usually has to be first loaded into the memory in order to be processed. Hybrid XML-relational databases XML documents are stored on disk pages in tree structures matching the XML data model [7, 9, 16]. Hence no mapping is needed between XML and relational structures. The native XML storage is complemented with XML indexes. It is provided XQuery and SQL support.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML

As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...

متن کامل

Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML

As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...

متن کامل

Prototyping a Vibrato-Aware Query-By-Humming (QBH) Music Information Retrieval System for Mobile Communication Devices: Case of Chromatic Harmonica

Background and Aim: The current research aims at prototyping query-by-humming music information retrieval systems for smart phones. Methods: This multi-method research follows simulation technique from mixed models of the operations research methodology, and the documentary research method, simultaneously. Two chromatic harmonica albums comprised the research population. To achieve the purpose ...

متن کامل

Mapping XML to inverted indexed circular linked lists

Extensible Markup Language (XML) has become the de facto standard for data exchange on the World Wide Web and is widely used in many fields, so it is urgent to develop some efficient methods to manage, store, and query XML data. Traditional methods use relational databases to store XML data which take advantage of mature technologies of relational databases. But it needs to map XML schemas to r...

متن کامل

Storing and Updating XML Data Tree based on Linked Lists

XML has become the de facto standard for data exchange and transformation on the World Wide Web and is widely used in many applications of various fields, so it is urgent to develop some efficient methods to manage, store, query, and update XML data. There are two main methods to do this: the first method is a native approach which uses native XML databases to store XML data, and the second met...

متن کامل

Survey of Mapping XML DTDs (Documents) to Relational Schemas

Extensible Markup Language(XML) [29], [1] is emerging as the data format of the web era. XML has the characteristics, such as the extensibility and self-description, which make it becomes a criterion of data exchanging between applications [9]. In recent years, XML has become the standard format of data representation and exchange in the scientific research and business application. For taking ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007